Outlier Robust Online Learning
نویسندگان
چکیده
We consider the problem of learning from noisy data in practical settings where the size of data is too large to store on a single machine. More challenging, the data coming from the wild may contain malicious outliers. To address the scalability and robustness issues, we present an online robust learning (ORL) approach. ORL is simple to implement and has provable robustness guarantee—in stark contrast to existing online learning approaches that are generally fragile to outliers. We specialize the ORL approach for two concrete cases: online robust principal component analysis and online linear regression. We demonstrate the efficiency and robustness advantages of ORL through comprehensive simulations and predicting image tags on a large-scale data set. We also discuss extension of the ORL to distributed learning and provide experimental evaluations.
منابع مشابه
Simulation of Scour Pattern Around Cross-Vane Structures Using Outlier Robust Extreme Learning Machine
In this research, the scour hole depth at the downstream of cross-vane structures with different shapes (i.e., J, I, U, and W) was simulated utilizing a modern artificial intelligence method entitled "Outlier Robust Extreme Learning Machine (ORELM)". The observational data were divided into two groups: training (70%) and test (30%). Then, using the input parameters including the ratio of the st...
متن کاملLearning - by - exporting versus self - selection : New evidence for 19 sub - Saharan African countries
Article history: Received 19 August 2014 Received in revised form 9 September 2014 Accepted 11 September 2014 Available online 21 September 2014 We examine learning-by-exporting effects of manufacturing and services firms in 19 sub-Saharan African countries. Comparing several outlier-robust estimators, our results provide evidence for positive effects in the manufacturing sector when using the ...
متن کاملApplication of Outlier Robust Nonlinear Mixed Effect Estimation in Examining the Effect of Phenylephrine in Rat Corpus Cavernosum
Ignoring two main characteristics of the concentration-response data, correlation between observations and presence of outliers, may lead to misleading results. Therefore the special method should be considered. In this paper in to examine the effect of phenylephrine in rat Corpus cavernosum, outlier robust nonlinear mixed estimation is used. in this study, eight different doses of phenylephrin...
متن کاملSimultaneous robust estimation of multi-response surfaces in the presence of outliers
A robust approach should be considered when estimating regression coefficients in multi-response problems. Many models are derived from the least squares method. Because the presence of outlier data is unavoidable in most real cases and because the least squares method is sensitive to these types of points, robust regression approaches appear to be a more reliable and suitable method for addres...
متن کاملAn Online Outlier Detection Technique for Wireless Sensor Networks
We propose an online and local outlier detection technique with low resource consumption based on an unsupervised centered quartersphere support vector machine for wireless sensor networks. Using synthetic data, we demonstrate that our technique achieves better mining performance in terms of parameter selection using difference kernel functions compared to an earlier offline outlier detection t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1701.00251 شماره
صفحات -
تاریخ انتشار 2017